SEQUENCE LISTING 



<110> Pompejus, Markus / v 

Seulberger, Harald / 
Hoeffken, Hans Wolfgang / 
Doval, Jose Luis Revuelta 
Jimenez, Alberto 
Garcia, Maria Angeles Santos 




<120> Phosphoriboxyl-Pyrophosphate Synthetase Polypeptide 

<130> PF48687-2/DP 

<140> US 10/076,157 
<141> 2002-02-15 

<150> Germany, 19757755.5 
<151> 1997-12-23 

<160> 21 

<170> WordPerfect 8 

<210> 1 
<211> 1911 
<212> DNA 

<213> Ashbya gossypii 
<400> 1 

GGTAGTCGCT CAT CG AC AG A CACAATCGCG TGTTCTCTCT GAATCGTCCA TTGGGTGTCA 60 

GCATCCTGAT CGCGGGCGGA TGGAATGGGT AATCATTAGG AAACACCAAT GTCCCATGGT 120 

ATTGTCCGTC CTCGTATGGT GTCTCAGGAG GACCCGTGAT CACGTAGTGC CACACCAGGA 180 

TATTGTCTTC CTTTGGTGCT GCCACGATGT AGGGCGGGGG GTTCTCGGTC ATCATTTTGT 24 0 

ACTCCTTTGA GAGCCGCTTG TACGCCTGTC TTGATGCCAT CTTGCCTACT ATTAGTTTCT 300 

CACCACTTCC CGCCAAACAA TCTGCACTTT ACGAGCGCTA TCTATCCCTC GGGTCGCTCT 360 

AGTTGATTAT TGGCGAAACT GATAGTTCAG GTACTTCCAT GATGCGGTCA TATCCACGTA 4 20 

TGTGATCACG T GAT CATC AG CCATGCTGCC AGCTCACGGG CCTGCCTACA CTATTGGAGG 4 80 

CTCTGTGAGT CATGATTTAT TGCATATCAA GCCCAGATAG TCGTTGGGGA TACTACCGTT 54 0 

GCCGCGATGA GCTCCGATAT TAAGTTGTAG CCAAAAATTT TAACGGATGA CTTCTTAACA 600 

GTTATTGACG CCGCAATCCT ACGCC ATG TCG TCC AAT AGC ATA AAG CTG CTA 652 



Met Ser Ser Asn Ser lie Lys Leu Leu 
1 5 



GCA GGT AAC TCG CAC CCG GAC CTA GCT GAG AAG GTC TCC GTT CGC CTA 
Ala Gly Asn Ser His Pro Asp Leu Ala Glu Lys Val Ser Val Arg Leu 
10 ~ 15 " 20 25 



700 



GGT GTA CCA CTT TCG AAG ATT GGA GTG TAT CAC TAC TCT AAC AAA GAG 
Gly Val Pro Leu Ser Lys lie Gly Val Tyr His Tyr Ser Asn Lys Glu 



748 



30 35 40 

ACG TCA GTT ACT ATC GGC GAA AGT ATC CGT GAT GAA GAT GTC TAC ATC 7 96 

Thr Ser Val Thr He Gly Glu Ser He Arg Asp Glu Asp Val Tyr He 
45 50 55 

ATC CAG ACA GGA ACG GGG GAG CAG GAA ATC AAC GAC TTC CTC ATG GAA 84 4 

He Gin Thr Gly Thr Gly Glu Gin Glu He Asn Asp Phe Leu Met Glu 
60 65 70 

CTG CTC ATC ATG ATC CAT GCC TGC CGG TCA GCC TCT GCG CGG AAG ATC 892 
Leu Leu He Met He His Ala Cys Arg Ser Ala Ser Ala Arg Lys He 
75 80 85 

ACA GCG GTT ATA CCA AAC TTC CCT TAC GCA AGA CAA GAC AAA AAG GAC 94 0 

Thr Ala Val He Pro Asn Phe Pro Tyr Ala Arg Gin Asp Lys Lys Asp 
90 95 100 105 

AAG TCG CGA GCA CCG ATA ACT GCC AAG CTG GTG GCC AAG ATG CTA GAG 988 
Lys Ser Arg Ala Pro He Thr Ala Lys Leu Val Ala Lys Met Leu Glu 
110 115 120 

ACC GCG GGG TGC AAC CAC GTT ATC ACG ATG GAT TTG CAC GCG TCT CAA 1036 
Thr Ala Gly Cys Asn His Val He Thr Met Asp Leu His Ala Ser Gin 
125 130 135 

ATT CAG GGT TTC TTC CAC ATT CCA GTG GAC AAC CTA TAT GCA GAG CCG 1084 
He Gin Gly Phe Phe His He Pro Val Asp Asn Leu Tyr Ala Glu Pro 
140 145 150 

AAC ATC CTG CAC TAC ATC CAA CAT AAT GTG GAC TTC CAG AAT AGT ATG 1132 
Asn He Leu His Tyr He Gin His Asn Val Asp Phe Gin Asn Ser Met 
155 " 160 165 

TTG GTC GCG CCA GAC GCG GGG TCG GCG AAG CGC ACG TCG ACG CTT TCG 1180 
Leu Val Ala Pro Asp Ala Gly Ser Ala Lys Arg Thr Ser Thr Leu Ser 
170 175 180 185 

GAC AAG CTG AAT CTC AAC TTC GCG TTG ATC CAC AAA GAA CGG CAG AAG 1228 
Asp Lys Leu Asn Leu Asn Phe Ala Leu He His Lys Glu Arg Gin Lys 
190 195 200 

GCG AAC GAG GTC TCG CGG ATG GTG TTG GTG GGT GAT GTC GCC GAC AAG 127 6 

Ala Asn Glu Val Ser Arg Met Val Leu Val Gly Asp Val Ala Asp Lys 
205 210 215 

TCC TGT ATT ATT GTA GAC GAC ATG GCG GAC ACG TGC GGA ACG CTA GTG 1324 
Ser Cys He He Val Asp Asp Met Ala Asp Thr Cys Gly Thr' Leu Val 
220 225 230 

AAG GCC ACT GAC ACG CTG ATC GAA AAT TGT GCG AAA GAA GTG ATT GCC 137 2 

Lys Ala Thr Asp Thr Leu He Glu Asn Cys Ala Lys Glu Val He Ala 
235 240 245 

ATT GTG ACA CAC GGT ATA TTT TCT GGC GGC GCC CGC GAG AAG TTG CGC 1420 
He Val Thr His Gly He Phe Ser Gly Gly Ala Arg Glu Lys Leu Arg 
250 255 260 265 

AAC AGC AAG CTG GCA CGG ATC GTA AGC ACA AAT ACG GTG CCA GTG GAC 14 68 

Asn Ser Lys Leu Ala Arg He Val Ser Thr Asn Thr Val Pro Val Asp 



270 



275 



280 



CTC AAT CTA GAT ATC TAC CAC CAA ATT GAC ATT AGT GCC ATT TTG GCC 1516 
Leu Asn Leu Asp He Tyr His Gin He Asp He Ser Ala He Leu Ala 
285 290 295 

GAG GCA ATT AGA AGG CTT CAC AAC GGG GAA AGT GTG TCG TAC CTG TTC 1564 
Glu Ala He Arg Arg Leu His Asn Gly Glu Ser Val Ser Tyr Leu Phe 
300 " 305 310 

AAT AAC GCT GTC ATG TAGTGCTGTC AGTGGCAGAT GCATGATCGC TGGCCTAATT 1619 
Asn Asn Ala Val Met 
315 



ATCTGTGTAA 


GTTGATACAA 


TGCAGTAAAT 


ACAGTACATA 


AAACTGAATG 


TTTTTCACTT 


1679 


AGGGGTGCTT 


TGTTGTTCTG 


ATAGCGTGTG 


TGCGAATTTG 


GAGGTGAAAG 


TTGAACATCA 


1739 


CGTAATGAAT 


ACAAACAAGA 


TTGCACATTA 


GGAAAAGCGA 


TAAATTATTT 


ATTATTTGCA 


1799 


ACTGGCCTTT 


GAGCGTTTAA 


GCCTGAACAT 


TTTTGCCCTT 


TTGTTTGACC 


GTACCGTTAT 


1859 


CACTCGTCCT 


TATATATGGC 


TATCCTTCTC 


TTCCGGAACT 


TCTTCGAGCG 


TA 


1911 



<210> 2 
<211> 318 
<212> PRT 

<213> Ashbya gossypii 
<400> 2 

Met Ser Ser Asn Ser He Lys Leu Leu Ala Gly Asn Ser His Pro Asp 
1 5 10 15 

Leu Ala Glu Lys Val Ser Val Arg Leu Gly Val Pro Leu Ser Lys He 
20 25 30 

Gly Val Tyr His Tyr Ser Asn Lys Glu Thr Ser Val Thr He Gly Glu 
35 40 45 

Ser He Arg Asp Glu Asp Val Tyr He He Gin Thr Gly Thr Gly Glu 
50 55 60 

Gin Glu He Asn Asp Phe Leu Met Glu Leu Leu He Met He His Ala 
65 70 75 80 

Cys Arg Ser Ala Ser Ala Arg Lys He Thr Ala Val He Pro Asn Phe 
85 90 95 

Pro Tyr Ala Arg Gin Asp Lys Lys Asp Lys Ser Arg Ala Pro He Thr 
100 105 110 

Ala Lys Leu Val Ala Lys Met Leu Glu Thr Ala Gly Cys Asn His Val 
115 120 125 

He Thr Met Asp Leu His Ala Ser Gin He Gin Gly Phe Phe His He 
130 ^ 135 140 



Pro Val Asp Asn Leu Tyr Ala Glu Pro Asn He Leu His Tyr He Gin 



145 

His Asn 
Ser Ala 
Ala Leu 



Val Leu 
210 

Met Ala 
225 

Glu Asn 



Ser Gly 



Val Ser 



Gin He 
290 

Asn Gly 
305 



Val Asp Phe 
165 

Lys Arg Thr 
180 

He His Lys 
195 

Val Gly Asp 
Asp Thr Cys 



Cys Ala Lys 
245 

Gly Ala Arg 
260 

Thr Asn Thr 
275 

Asp He Ser 



150 

Gin Asn Ser Met 



Ser Thr 



Glu Arg 



Leu Ser 
185 

Gin Lys 
200 



155 

Leu Val 
170 

Asp Lys 
Ala Asn 



Val Ala 
215 



Asp Lys Ser Cys 
Leu Val 
Glu Val He Ala 



Gly Thr 
230 



Lys Ala 
235 



Glu Ser Val 



Glu Lys 



Val Pro 



Ala He 
295 

Ser Tyr 
310 



Leu Arg 
265 

Val Asp 
280 



He Val 
250 

Asn Ser 



Leu Asn 



Ala Pro Asp 

Leu Asn Leu 
190 

Glu Val Ser 
205 

He He Val 
220 

Thr Asp Thr 
Thr His Gly 



Leu Ala Glu Ala 

Leu Phe Asn Asn 
315 



Lys Leu Ala 
270 

Leu Asp He 
285 

He Arg Arg 
300 

Ala Val Met 



160 

Ala Gly 
175 

Asn Phe 



Arg Met 

Asp Asp 

Leu He 
240 

He Phe 
255 

Arg He 
Tyr His 
Leu His 



<210> 3 
<211> 5369 
<212> DNA 

<213> Ashbya gossypii 
<400> 3 



AAGCTTGACC TTGGCTGGCA CTTGAGTCGG CAGACAGGTG GACTAACCCG AGCA ATG 57 

Met 
1 

GAT CGT GGT TGT AAA GGT ATC TCT TAT GTG CTC AGT GCA ATG GTT TTT 105 
Asp Arg Gly Cys Lys Gly He Ser Tyr Val Leu Ser Ala Met Val Phe 
5 " * 10 15 

CAC ATA ATA CCG ATT ACA TTT GAA ATA TCG ATG GTA TGT GGC ATA TTG 153 
His He He Pro He Thr Phe Glu He Ser Met Val Cys Gly He Leu 
20 25 30 

ACA TAC CAG TTT GGT GCT TCC TTC GCT GCT ATA ACA TTC TCG ACT ATG 201 
Thr Tyr Gin Phe Gly Ala Ser Phe Ala Ala He Thr Phe Ser Thr Met 
35 40 45 

CTT CTT TAC TCC ATC TTT ACT TTC AGA ACG ACG GCG TGG CGC ACA CGG 24 9 

Leu Leu Tyr Ser He Phe Thr Phe Arg Thr Thr Ala Trp Arg Thr Arg 
50 55 60 65 



TTT AGG CGT GAT GCG AAC AAG GCT GAC AAT AAG GCC GCT AGT GTG GCA 
Phe Arg Arg Asp Ala Asn Lys Ala Asp Asn Lys Ala Ala Ser Val Ala 
70 75 80 



297 



TTG GAT TCC CTA ATA AAT TTT GAA GCT GTA AAG TAT TTC AAT AAC GAG 
Leu Asp Ser Leu lie Asn Phe Glu Ala Val Lys Tyr Phe Asn Asn Glu 
85 90 95 



345 



AAG TAC CTT GCG GAC AAG TAT CAC ACA TCC TTG ATG AAG TAC CGG GAT 
Lys Tyr Leu Ala Asp Lys Tyr His Thr Ser Leu Met Lys Tyr Arg Asp 
100 ' ^ 105 110 



393 



TCC CAG ATA AAG GTC TCG CAA TCG CTG GCG TTT TTG AAC ACC GGC CAG 
Ser Gin lie Lys Val Ser Gin Ser Leu Ala Phe Leu Asn Thr Gly Gin 
115 120 125 



441 



AAC CTA ATT TTT ACC ACT GCA CTG ACT GCA ATG ATG TAT ATG GCC TGT 
Asn Leu lie Phe Thr Thr Ala Leu Thr Ala Met Met Tyr Met Ala Cys 
130 135 140 145 



489 



AAT GGT GTT ATG CAG GGC TCT CTT ACA GTG GGG GAT CTT GTG TTA ATT 
Asn Gly Val Met Gin Gly Ser Leu Thr Val Gly Asp Leu Val Leu lie 
150 " 155 160 



537 



AAT CAA CTG GTA TTC CAG CTC TCC GTG CCA CTA AAC TTC CTT GGT AGC 
Asn Gin Leu Val Phe Gin Leu Ser Val Pro Leu Asn Phe Leu Gly Ser 
165 170 175 



585 



GTC TAC CGT GAT CTC AAG CAG TCT CTG ATA GAT ATG GAA TCT TTA TTT 
Val Tyr Arg Asp Leu Lys Gin Ser Leu lie Asp Met Glu Ser Leu Phe 
180 185 ~ 190 



633 



AAA CTG CAA AAA AAT CAG GTC ACA ATT AAG AAC TCC CCA AAT GCC CAG 
Lys Leu Gin Lys Asn Gin Val Thr lie Lys Asn Ser Pro Asn Ala Gin 
195 200 " 205 



681 



AAC CTA CCA ATA CAC AAA CCG TTG GAT ATT CGC TTT GAA AAT GTT ACG 
Asn Leu Pro lie His Lys Pro Leu Asp lie Arg Phe Glu Asn Val Thr 
210 215 220 225 



729 



TTT GGC TAT GAC CCG GAG CGG CGT ATA TTG AAC AAT GTT TCG TTT ACC 
Phe Gly Tyr Asp Pro Glu Arg Arg lie Leu Asn Asn Val Ser Phe Thr 
230 235 240 



777 



ATC CCA GCT GGA ATG AAG ACT GCC ATA GTA GGC CCA TCG GGC TCG GGG 
lie Pro Ala Gly Met Lys Thr Ala lie Val Gly Pro Ser Gly Ser Gly 
245 250 255 



825 



AAG TCC ACC ATT TTG AAG CTC GTA TTT AGA TTC TAT GAG CCC GAG CAA 
Lys Ser Thr lie Leu Lys Leu Val Phe Arg Phe Tyr Glu Pro Glu Gin 
260 265 270 



873 



GGT CGT ATC CTA GTT GGC GGC ACA GAT ATC CGC GAT TTA GAC TTG CTT 
Gly Arg lie Leu Val Gly Gly Thr Asp lie Arg Asp Leu Asp Leu Leu 
275 280 285 



921 



TCT TTA CGG AAG GCT ATC GGT GTC GTG CCC CAA GAT ACT CCT CTC TTC 
Ser Leu Arg Lys Ala lie Gly Val Val Pro Gin Asp Thr Pro Leu Phe 
290 295 300 305 



969 



AAT GAC ACA ATC TGG GAG AAT GTT AAA TTC GGC AAT ATC AGT TCC TCT 1017 
Asn Asp Thr lie Trp Glu Asn Val Lys Phe Gly Asn lie Ser Ser Ser 
310 ~ 315 320 

GAC GAT GAG ATT CTC AGG GCC ATA GAA AAA GCT CAA CTC ACG AAG CTA 1065 
Asp Asp Glu lie Leu Arg Ala lie Glu Lys Ala Gin Leu Thr Lys Leu 
325 ' 330 335 

CTC CAG AAC CTA CCA AAG GGC GCT TCC ACC GTT GTA GGG GAG CGC GGT 1113 
Leu Gin Asn Leu Pro Lys Gly Ala Ser Thr Val Val Gly Glu Arg Gly 
340 345 350 

TTG ATG ATC AGC GGA GGT GAG AAA CAA AGG CTT GCT ATT GCT CGT GTG 1161 
Leu Met lie Ser Gly Gly Glu Lys Gin Arg Leu Ala lie Ala Arg Val 
355 ~ 360 365 

CTT TTG AAG GAC GCT CCG CTG ATG TTT TTC GAC GAG GCT ACA AGT GCT 1209 
Leu Leu Lys Asp Ala Pro Leu Met Phe Phe Asp Glu Ala Thr Ser Ala 
370 ' 375 380 385 

CTG GAT ACA CAC ACA GAG CAG GCA CTC TTG CAC ACC ATT CAG CAG AAC 1257 
Leu Asp Thr His Thr Glu Gin Ala Leu Leu His Thr lie Gin Gin Asn 
390 395 400 

TTT TCT TCC AAT TCA AAG ACG AGC GTT TAC GTT G,CC CAT AGA CTG CGC 1305 
Phe Ser Ser Asn Ser Lys Thr Ser Val Tyr Val Ala His Arg Leu Arg 
405 410 415 

ACA ATC GCT GAT GCA GAT AAG ATC ATT GTT CTT GAA CAA GGT TCT GTC 1353 
Thr lie Ala Asp Ala Asp Lys lie lie Val Leu Glu Gin Gly Ser Val 
420 425 430 

CGC GAA GAG GGC ACA CAC AGC TCG CTG TTA GCG TCA CAA GGA TCC CTA 14 01 

Arg Glu Glu Gly Thr His Ser Ser Leu Leu Ala Ser Gin Gly Ser Leu 
435 440 445 

TAC CGG GGT CTG TGG GAT ATT CAG GAA AAC CTA ACG CTT CCG GAA CGG 14 4 9 

Tyr Arg Gly Leu Trp Asp lie Gin Glu Asn Leu Thr Leu Pro Glu Arg 
450 ~ 455 460 465 

CCT GAG CAG TCA ACC GGA TCT CAG CAT GCA TAGACGTCTG ACTAGAGATT 14 99 

Pro Glu Gin Ser Thr Gly Ser Gin His Ala 
470 475 

ATATAATAAC CCTCGAGCCA AAATTATACG GCGCTAACAA GTAAAAATTT TAGTTACTTT 155 9 

TCTGACTTCT CTACGCTGAC TTCTCTACCC T TCT AAC AT A GTTAATTGAA GTAGTGGTTA 1619 

AT GAC GAC TG CATTTTATTA TTGTCCACTT TGCATTAGAA GTACTAGTGC TTAAGCGCTC 167 9 

TTTAGGCCGC TTTCTTCTTC TTTGTCAGGC CGCAAGGTAA AGG AAG CAC C AACGGATTGC 1739 

TACCGCTGCT ATTCCTGCTC TCTCAAG ATG TGT GGC ATA TTA GGC GTT GTG 17 90 

Met Cys Gly He Leu Gly Val Val 
1 5 

CTA GCC GAT CAG TCG AAG GTG GTC GCC CCT GAG TTG TTT GAT GGC TCA 1838 
Leu Ala Asp Gin Ser Lys Val Val Ala Pro Glu Leu Phe Asp Gly Ser 
10 15 20 



CTG TTC TTA CAG CAT CGC GGT CAA GAT GCT GCC GGG ATT GCT ACG TGC 188 6 

Leu Phe Leu Gin His Arg Gly Gin Asp Ala Ala Gly lie Ala Thr Cys 
25 30 35 40 

GGC CCC GGT GGG CGC TTG TAC CAA TGT AAG GGC AAT GGT ATG GCA CGG 1934 
Gly Pro Gly Gly Arg Leu Tyr Gin Cys Lys Gly Asn Gly Met Ala Arg 
45 50 55 

GAC GTG TTC ACG CAA GCT CGG ATG TCA GGG TTG GTT GGC TCT ATG GGG 1982 
Asp Val Phe Thr Gin Ala Arg Met Ser Gly Leu Val Gly Ser Met Gly 
60 65 70 

ATT GCA CAC CTG AGA TAT CCC ACT GCA GGC TCC AGT GCG AAC TCA GAA 2030 
lie Ala His Leu Arg Tyr Pro Thr Ala Gly Ser Ser Ala Asn Ser Glu 
75 80 85 

GCG CAG CCA TTC TAT GTG AAT AGT CCC TAC GGA ATT TGC ATG AGT CAT 2078 
Ala Gin Pro Phe Tyr Val Asn Ser Pro Tyr Gly lie Cys Met Ser His 
90 95 100 

AAT GGT AAT CTG GTG AAC ACG ATG TCT CTA CGT AGA TAT CTT GAT GAA 2126 
Asn Gly Asn Leu Val Asn Thr Met Ser Leu Arg Arg Tyr Leu Asp Glu 
105 ~ 110 115 ~ 1 120 

GAC GTT CAC CGT CAT ATT AAC ACG GAC AGC GAT TCT GAG CTA CTG CTT 2174 
Asp Val His Arg His lie Asn Thr Asp Ser Asp Ser Glu Leu Leu Leu 
125 130 135 

AAT ATA TTT GCC GCG GAG CTG GAA AAG TAC AAC AAA TAT CGT GTG AAC 2222 
Asn lie Phe Ala Ala Glu Leu Glu Lys Tyr Asn Lys Tyr Arg Val Asn 
140 145 150 

AAC GAT GAT ATA TTT TGT GCT CTA GAG GGT GTT TAC AAA CGT TGT CGC 2270 
Asn Asp Asp lie Phe Cys Ala Leu Glu Gly Val Tyr Lys Arg Cys Arg 
155 160 165 

GGT GGC TAT GCT TGT GTT GGC ATG TTG GCG GGA TAT GGA TTG TTT GGT 2318 
Gly Gly Tyr Ala Cys Val Gly Met Leu Ala Gly Tyr Gly Leu Phe Gly 
170 175 180 

TTC CGG GAC CCC AAT GGG ATC AGG CCG CTA TTG TTT GGT GAG CGC GTC 2366 
Phe Arg Asp Pro Asn Gly lie Arg Pro Leu Leu Phe Gly Glu Arg Val 
185 190 195 200 

AAC GAT GAC GGC ACC ATG GAC TAC ATG CTA GCG TCC GAA AGT GTC GTT 2414 
Asn Asp Asp Gly Thr Met Asp Tyr Met Leu Ala Ser Glu Ser Val Val 
205 210 215 

CTT AAG GCC CAC CGC TTC CAA AAC ATA CGT GAT ATT CTT CCC GGC CAA 24 62 

Leu Lys Ala His Arg Phe Gin Asn lie Arg Asp lie Leu Pro Gly Gin 
220 " 225 230 

GCC GTC ATT ATC CCT AAA ACG TGC GGC TCC AGT CCA CCA GAG TTC CGG 2510 
Ala Val lie lie Pro Lys Thr Cys Gly Ser Ser Pro Pro Glu Phe Arg 
235 240 245 

CAG GTA GTG CCA ATT GAG GCC TAC AAA CCG GAC TTG TTT GAG TAC GTG 2558 
Gin Val Val Pro lie Glu Ala Tyr Lys Pro Asp Leu Phe Glu Tyr Val 
250 255 260 



TAT TTC GCT CGT GCT GAC AGC GTT CTG GAC GGT ATT TCC GTT TAC CAT 2 606 

Tyr Phe Ala Arg Ala Asp Ser Val Leu Asp Gly lie Ser Val Tyr His 
265 270 275 280 

ACA CGC CTG TTG ATG GGT ATC AAA CTT GCC GAG AAC ATC AAA AAA CAG 2654 
Thr Arg Leu Leu Met Gly lie Lys Leu Ala Glu Asn lie Lys Lys Gin 
285 "* 290 295 

ATC GAT CTG GAC GAA ATT GAC GTT GTT GTA TCT GTT CCT GAC ACT GCA 27 02 

lie Asp Leu Asp Glu lie Asp Val Val Val Ser Val Pro Asp Thr Ala 
300 305 310 

CGT ACC TGT GCA TTG GAG TGT GCC AAC CAT TTA AAC AAA CCT TAT CGC 27 50 

Arg Thr Cys Ala Leu Glu Cys Ala Asn His Leu Asn Lys Pro Tyr Arg 
315 320 325 

GAA GGA TTT GTC AAG AAC AGA TAT GTT GGA AGA ACA TTT ATC ATG CCA 27 98 

Glu Gly Phe Val Lys Asn Arg Tyr Val Gly Arg Thr Phe lie Met Pro 
330 335 340 

AAC CAA AAA GAG CGA GTA TCT TCT GTG CGC CGC AAG TTG AAC CCA ATG 284 6 

Asn Gin Lys Glu Arg Val Ser Ser Val Arg Arg Lys Leu Asn Pro Met 
345 350 355 360 

AAC TCA GAA TTT AAA GAC AAG CGC GTG CTG ATT GTC GAT GAT TCC ATT 28 94 

Asn Ser Glu Phe Lys Asp Lys Arg Val Leu lie Val Asp Asp Ser lie 
365 ~ 370 375 

GTG CGA GGT ACC ACT TCC AAA GAG ATT GTT AAC ATG GCG AAG GAA TCC 2 942 

Val Arg Gly Thr Thr Ser Lys Glu lie Val Asn Met Ala Lys Glu Ser 
380 385 390 

GGT GCT GCC AAG GTC TAC TTT GCC TCT GCA GCG CCA GCA ATT CGT TTC 2 990 

Gly Ala Ala Lys Val Tyr Phe Ala Ser Ala Ala Pro Ala lie Arg Phe 
395 400 405 

AAT CAC ATC TAC GGG ATT GAC CTA GCA GAT ACT AAG CAG CTT GTC GCC 3038 
Asn His lie Tyr Gly lie Asp Leu Ala Asp Thr Lys Gin Leu Val Ala 
410 ~ 415 420 

TAC AAC AGA ACT GTT GAA GAA ATC ACT GCG GAG CTG GGC TGT GAC CGC 308 6 

Tyr Asn Arg Thr Val Glu Glu lie Thr Ala Glu Leu Gly Cys Asp Arg 
425 430 435 440 

GTC ATC TAT CAA TCT TTG GAT GAC CTC ATC GAC TGT TGC AAG ACA GAC 3134 
Val lie Tyr Gin Ser Leu Asp Asp Leu lie Asp Cys Cys Lys Thr Asp 
445 450 455 

ATC ATC TCA GAA TTT GAA GTT GGA GTT TTC ACT GGT AAC TAC GTT ACA 3182 
He He Ser Glu Phe Glu Val Gly Val Phe Thr Gly Asn Tyr Val Thr 
460 465 470 

GGT GTT GAG GAT GTG TAC TTG CAG GAA TTA GAA CGT TGC CGC GCT CTT 3230 
Gly Val Glu Asp Val Tyr Leu Gin Glu Leu Glu Arg Cys Arg Ala Leu 
475 480 485 

AAT AAC TCG AAT AAG GGT GAA GCG AAG GCC GAG GTT GAT ATT GGT CTC 3278 
Asn Asn Ser Asn Lys Gly Glu Ala Lys Ala Glu Val Asp He Gly Leu 
490 495 500 



TAC AAT TCT GCC GAC TAT TAGCGGCGCC GTTGCCGGCA TCCGGCCCCA 3326 
Tyr Asn Ser Ala Asp Tyr 
505 ~ 510 

TATATAGACT CATCGGGACC TAAAATAAGC CTTTACAGAT CAT TAT C TAC AAATATAGAT 3386 

ACCATTAAAA GCCTGACTTT CGACTTACTC CTAGCACACC CCGTTGTATC CCTGTGCTTG 34 4 6 

CTTTCTTAAA TGCCGTTGGT TAGGCTTTGG ACTTAGCGTC CCGCCCATTT TCTAGCATGT 3506 

GCAGATCTAG CAAATTTGGC CTAAGACAAG AAGATCCATT CGGCACCCAC ATCCTGGAGC 3566 

CAGCACACAG TGGACCCAGA C ATG AGC AGC GGC AAT ATA TGG AAG CAA TTG 3617 

Met Ser Ser Gly Asn lie Trp Lys Gin Leu 
1 5 10 

CTA GAG GAG AAT AGC GAA CAG CTG GAC CAG TCC ACT ACG GAG ACT TAC 3665 
Leu Glu Glu Asn Ser Glu Gin Leu Asp Gin Ser Thr Thr Glu Thr Tyr 
15 20 25 

GTG GTA TGC TGC GAG AAC GAA GAT TCC CTT AAC CAG TTT TTG CAA CAA 3713 
Val Val Cys Cys Glu Asn Glu Asp Ser Leu Asn Gin Phe Leu Gin Gin 
30 35 40 

TGT TGG CAG ATT GAC GAG GGC GAG AAG GTG ACC AAC CTG GAG CCG TTG 37 61 

Cys Trp Gin lie Asp Glu Gly Glu Lys Val Thr Asn Leu Glu Pro Leu 
45 50 55 

GGA TTC TTT ACA AAG GTG GTT TCG CGC GAC GAA GAG AAC CTC CGG CTC 3809 
Gly Phe Phe Thr Lys Val Val Ser Arg Asp Glu Glu Asn Leu Arg Leu 
60 65 70 

AAC GTA TAC TAT GCC AAG AGC CCA CTG GAT GCA CAG ACG CTG CAG TTT 3857 
Asn Val Tyr Tyr Ala Lys Ser Pro Leu Asp Ala Gin Thr Leu Gin Phe 
75 " 80 85 90 

CTG GGC GTG TTC CTG CGC CAA ATG GAA ACC TCA CAA ATA CGT TGG ATC 3905 
Leu Gly Val Phe Leu Arg Gin Met Glu Thr Ser Gin lie Arg Trp lie 
95 100 105 

TTC CTA CTG GAC TGG CTG CTA GAC GAT AAA CGA TTA TGG CTA CGT CAA 3953 
Phe Leu Leu Asp Trp Leu Leu Asp Asp Lys Arg Leu Trp Leu Arg Gin 
110 ~ 115 120 

CTG CGG AAC TCG TGG GCC GCC TTG GAG GAA GCG CAG GTG GCA CCC TTT 4001 
Leu Arg Asn Ser Trp Ala Ala Leu Glu Glu Ala Gin Val Ala Pro Phe 
125 130 135 

CCA GGT GGC GCT GTG GTG GTG GTC CTC AAC CCG AGT CAC GTG ACA CAA 404 9 

Pro Gly Gly Ala Val Val Val Val Leu Asn Pro Ser His Val Thr Gin 
140 145 150 

CTG GAG CGA AAC ACG ATG GTT TGG AAC TCC CGC CGT CTG GAC CTG GTA 4097 
Leu Glu Arg Asn Thr Met Val Trp Asn Ser Arg Arg Leu Asp Leu Val 
155 160 165 170 



CAC CAG ACA CTG CGA GCT GCA TGC CTC AAC ACC GGC TCG GCG CTA GTT 
His Gin Thr Leu Arg Ala Ala Cys Leu Asn Thr Gly Ser Ala Leu Val 
175 ~ 180 185 



4145 



ACA CTT GAT CCT AAT ACT GCG CGC GAA GAC GTC ATG CAC ATA TGT GCG 4193 
Thr Leu Asp Pro Asn Thr Ala Arg Glu Asp Val Met His lie Cys Ala 
190 195 200 

CTG CTT GCG GGG CTG CCT ACA TCC CGT CCC GTC GCG ATG CTA AGC CTG 4241 
Leu Leu Ala Gly Leu Pro Thr Ser Arg Pro Val Ala Met Leu Ser Leu 
205 210 215 

CAA AGT CTA TTC ATC CCC CAC GGT GCA GAT TCC ATC GGC AAG ATC TGC 4 289 

Gin Ser Leu Phe lie Pro His Gly Ala Asp Ser lie Gly Lys lie Cys 
220 225 230 

ACC ATC GCG CCC GAG TTC CCT GTT GCT ACG GTG TTC GAC AAC GAT TTT 4 337 

Thr He Ala Pro Glu Phe Pro Val Ala Thr Val Phe Asp Asn Asp Phe 
235 240 245 250 

GTG AGC TCG ACA TTC GAG GCC GCA ATT GCT CCA GAA CTT ACT CCA GGA 4 385 

Val Ser Ser Thr Phe Glu Ala Ala He Ala Pro Glu Leu Thr Pro Gly 
255 260 265 

CCA CGT GTG CCA TCT GAC CAC CCA TGG CTA ACA GAG CCT ACC AAC CCC 4 433 

Pro Arg Val Pro Ser Asp His Pro Trp Leu Thr Glu Pro Thr Asn Pro 
270 275 280 

CCT TCG GAG GCA ACC GCT TGG CAT TTC GAT CTC CAA GGT CGC CTC GCT 44 81 

Pro Ser Glu Ala Thr Ala Trp His Phe Asp Leu Gin Gly Arg Leu Ala 
285 290 295 

ACC CTA TAC CGG CAT CTT GGT GAC TCT AAC AAG GCC ATA TCT GTT ACT 4 52 9 

Thr Leu Tyr Arg His Leu Gly Asp Ser Asn Lys Ala He Ser Val Thr 
300 305 310 

CAG CAC CGC TTC CAC AAG CCC CGC TCG GAA GAT TAT GCA TAC GAA TTC 4 577 

Gin His Arg Phe His Lys Pro Arg Ser Glu Asp Tyr Ala Tyr Glu Phe 
315 ~ 320 325 330 

GAG CTG CCG TCT AAG CAC CCT ACA ATA CGT GAC CTC ATA CGC TCT GCC 4 625 

Glu Leu Pro Ser Lys His Pro Thr He Arg Asp Leu He Arg Ser Ala 
335 340 345 

GCA GCC GAC TCA CCG AAC GAC GTC GCT GAC TCC ATC GAT GGG CTT ATG 4 673 

Ala Ala Asp Ser Pro Asn Asp Val Ala Asp Ser He Asp Gly Leu Met 
350 355 360 

GAT GGT ATC GTA CAA AGG AAT GTT CAT TGACGTCGAC ACAAAAATTT 4 720 

Asp Gly He Val Gin Arg Asn Val His 
365 370 

TGTTACTGTT CTCTCGAGAA CTATTCTCAT CCAGTACTGA CATATTAGAA GGCGAAGTGA 47 80 

ACTAGGATTT ATATAAAGTA GCCTTCAGGC AATTGCACAG GGTCTATTGA GTCGCTGCCG 4 840 

TTCACGAGAG AGCCCAATAT ATCGAGGACT AATTGGTCAC TTTTGTTTTG CTATACTCAC 4 900 

CCTGTATTTG CTAATCATTT ATCCGCTTTG TCCAAGTGGT TGCGAAGATA TCGAGCCAGA 4 960 

AC AT TAG AAT CTGGTTTGCC GCATCCTAGA GCTGTCTCCA AGCCAGTTGA ACCGTTGCGG 5020 



GAGATTACCG CAGCCGGTTT GAT CAG AGT A CTGGTGACTG CCAGCACCCA CGTTTGTGAC 5080 



TTATAAATAT ACGCCCTGTG GAGCCATAGC CATTGGCATA AAGAGAAGAG CACCCCGTGC 514 0 

CACGATGCAG ACACTTCCGG TGTACCCAGC GTCACAGACT GCGTCGCCTA CGAAGCGTGA 5200 

ACTTGCAGCG GCGCCCTCGG TGCCGCAGGA CGGCGCCCGG CTGCCTGCGC AGCTCACTTT 5260 

AGTGACGCCC CCAGAACCTG ATATCCAGAA GAAGTCAGTG CGATCTCAGG TCGCGCGTTT 5320 

AAGCATCTCG GAGACAGATG TAGTGAAGAG TGATATCGTG GCTAAGCTT 5369 



<210> 4 
<211> 475 
<212> PRT 

<213> Ashbya gossypii 
<400> 4 

Met Asp Arg Gly Cys Lys Gly lie Ser Tyr Val Leu Ser Ala Met Val 
15 10 15 

Phe His He He Pro He Thr Phe Glu He Ser Met Val Cys Gly He 
20 25 30 

Leu Thr Tyr Gin Phe Gly Ala Ser Phe Ala Ala He- Thr Phe Ser Thr 
35 ~ 40 45 

Met Leu Leu Tyr Ser He Phe Thr Phe Arg Thr Thr Ala Trp Arg Thr 
50 55 60 

Arg Phe Arg Arg Asp Ala Asn Lys Ala Asp Asn Lys Ala Ala Ser Val 
65 ' 70 75 80 

Ala Leu Asp Ser Leu He Asn Phe Glu Ala Val Lys Tyr Phe Asn Asn 
85 90 95 

Glu Lys Tyr Leu Ala Asp Lys Tyr His Thr Ser Leu Met Lys Tyr Arg 
100 105 110 

Asp Ser Gin He Lys Val Ser Gin Ser Leu Ala Phe Leu Asn Thr Gly 
115 " 120 125 

Gin Asn Leu He Phe Thr Thr Ala Leu Thr Ala Met Met Tyr Met Ala 
130 135 140 

Cys Asn Gly Val Met Gin Gly Ser Leu Thr Val Gly Asp Leu Val Leu 
145 150 155 160 

He Asn Gin Leu Val Phe Gin Leu Ser Val Pro Leu Asn Phe Leu Gly 
165 170 175 

Ser Val Tyr Arg Asp Leu Lys Gin Ser Leu He Asp Met Glu Ser Leu 
180 185 190 

Phe Lys Leu Gin Lys Asn Gin Val Thr He Lys Asn Ser Pro Asn Ala 
195 200 205 

Gin Asn Leu Pro lie His Lys Pro Leu Asp He Arg Phe Glu Asn Val 
210 215 ^ 220 



Thr Phe Gly Tyr Asp Pro Glu Arg Arg lie Leu Asn Asn Val Ser Phe 
225 ' 230 235 240 

Thr He Pro Ala Gly Met Lys Thr Ala He Val Gly Pro Ser Gly Ser 
245 250 255 

Gly Lys Ser Thr He Leu Lys Leu Val Phe Arg Phe Tyr Glu Pro Glu 
260 ~ 265 ~ 270 

Gin Gly Arg He Leu Val Gly Gly Thr Asp He Arg Asp Leu Asp Leu 
275 280 285 

Leu Ser Leu Arg Lys Ala He Gly Val Val Pro Gin Asp Thr Pro Leu 
290 295 300 

Phe Asn Asp Thr He Trp Glu Asn Val Lys Phe Gly Asn He Ser Ser 
305 ' 310 315 320 

Ser Asp Asp Glu He Leu Arg Ala He Glu Lys Ala Gin Leu Thr Lys 
325 330 335 

Leu Leu Gin Asn Leu Pro Lys Gly Ala Ser Thr Val Val Gly Glu Arg 
340 345 350 

Gly Leu Met He Ser Gly Gly Glu Lys Gin Arg Leu Ala He Ala Arg 
355 360 365 

Val Leu Leu Lys Asp Ala Pro Leu Met Phe Phe Asp Glu Ala Thr Ser 
370 " 375 380 

Ala Leu Asp Thr His Thr Glu Gin Ala Leu Leu His Thr He Gin Gin 
385 * 390 395 400 

Asn Phe Ser Ser Asn Ser Lys Thr Ser Val Tyr Val Ala His Arg Leu 
405 . 410 415 

Arg Thr He Ala Asp Ala Asp Lys He He Val Leu Glu Gin Gly Ser 
420 425 430 . 

Val Arg Glu Glu Gly Thr His Ser Ser Leu Leu Ala Ser Gin Gly Ser 
435 440 445 

Leu Tyr Arg Gly Leu Trp Asp He Gin Glu Asn Leu Thr Leu Pro Glu 
450 ~ 455 460 

Arg Pro Glu Gin Ser Thr Gly Ser Gin His Ala 
465 470 475 



<210> 5 
<211> 510 
<212> PRT 

<213> Ashbya gossypii 
<400> 5 

Met Cys Gly He Leu Gly Val Val Leu Ala Asp Gin Ser Lys Val Val 
1 5 10 15 

Ala Pro Glu Leu Phe Asp Gly Ser Leu Phe Leu Gin His Arg Gly Gin 



20 

Asp Ala Ala Gly 

35 

Cys Lys Gly Asn 
50 

Ser Gly Leu Val 
65 

Ala Gly Ser Ser 



Pro Tyr Gly lie 
100 

Ser Leu Arg Arg 
115 

Asp Ser Asp Ser 
130 

Lys Tyr Asn Lys 
145 

Glu Gly Val Tyr 



Leu Ala Gly Tyr 
180 

Pro Leu Leu Phe 
195 

Met Leu Ala Ser 
210 

lie Arg Asp lie 
225 

Gly Ser Ser Pro 



Lys Pro Asp Leu 
2 60 

Leu Asp Gly lie 
275 

Leu Ala Glu Asn 
290 

Val Val Ser Val 
305 

Asn His Leu Asn 



He Ala Thr 

Gly Met Ala 
55 

Gly Ser Met 
70 

Ala Asn Ser 
85 

Cys Met Ser 

Tyr Leu Asp 

Glu Leu Leu 
135 

Tyr Arg Val 
150 

Lys Arg Cys 
165 

Gly Leu Phe 

Gly Glu Arg 

Glu Ser Val 
215 

Leu Pro Gly 
230 

Pro Glu Phe 
245 

Phe Glu Tyr 
Ser Val Tyr 



25 

Cys Gly 
40 

Arg Asp 
Gly He 
Glu Ala 



His Asn 
105 

Glu Asp 
120 



Pro Gly 
Val Phe 



Ala His 
75 

Gin Pro 
90 



Gly Arg 
45 

Thr Gin 
60 

Leu Arg 
Phe Tyr 



Gly Asn Leu Val 
Val His 



Leu Asn He Phe 



Asn Asn 



Arg Gly 



Asp Asp 
155 

Gly Tyr 
170 



Arg His 
125 

Ala Ala 
140 

He Phe 



Ala Cys 



Gly Phe 
185 



Arg Asp Pro Asn 
Asp Asp 
Val Leu Lys Ala 



Val Asn 
200 



Gly Thr 
205 



30 

Leu Tyr Gin 
Ala Arg Met 



Tyr Pro Thr 
80 

Val Asn Ser 
95 

Asn Thr Met 
110 

He Asn Thr 



Glu Leu Glu 



Cys Ala Leu 
160 

Val Gly Met 
175 

Gly He Arg 
190 

Met Asp Tyr 



Gin Ala 



Arg Gin 



Val Tyr 
265 



Val He 
235 

Val Val 
250 



His Arg 
220 

He Pro 



Pro He 



Phe Ala Arg Ala 



His Thr 
280 



lie Lys Lys 
295 

Pro Asp Thr 
310 



Arg Leu 
Gin He Asp Leu 



Lys Pro Tyr 
325 



Ala Arg 
Arg Glu 



Thr Cys 
315 



Leu Met 
285 

Asp Glu 
300 

Ala Leu 



Phe Gin Asn 

Lys Thr Cys 
' 240 

Glu Ala Tyr 
255 

Asp Ser Val 
270 

Gly He Lys 
He Asp Val 



Glu Cys Ala 
320 



Gly Phe 
330 



Asn Arg Tyr 
335 



Val Lys 

Val Gly Arg Thr Phe He Met Pro Asn Gin Lys Glu Arg Val Ser Ser 



340 



345 



350 



Val Arg Arg Lys Leu Asn Pro Met Asn Ser Glu Phe Lys Asp Lys Arg 
355 ~ 360 365 

Val Leu lie Val Asp Asp Ser lie Val Arg Gly Thr Thr Ser Lys Glu 
370 375 380 

He Val Asn Met Ala Lys Glu Ser Gly Ala Ala Lys Val Tyr Phe Ala 
385 390 395 400 

Ser Ala Ala Pro Ala He Arg Phe Asn His He Tyr Gly He Asp Leu 
405 410 415 

Ala Asp Thr Lys Gin Leu Val Ala Tyr Asn Arg Thr Val Glu Glu He 
420 425 430 

Thr Ala Glu Leu Gly Cys Asp Arg Val He Tyr Gin Ser Leu Asp Asp 
435 ^ ^ 440 445 

Leu He Asp Cys Cys Lys Thr Asp He He Ser Glu Phe Glu Val Gly 
450 ~ " 455 460 

Val Phe Thr Gly Asn Tyr Val Thr Gly Val Glu Asp Val Tyr Leu Gin 
465 470 475 480 

Glu Leu Glu Arg Cys Arg Ala Leu Asn Asn Ser Asn Lys Gly Glu Ala 
485 490 495 

Lys Ala Glu Val Asp lie Gly Leu Tyr Asn Ser Ala Asp Tyr 
500 505 510 



<210> 6 
<211> 371 
<212> PRT 

<213> Ashbya gossypii 
<400> 6 

Met Ser Ser Gly Asn He Trp Lys Gin Leu Leu Glu Glu Asn Ser Glu 
15 10 15 

Gin Leu Asp Gin Ser Thr Thr Glu Thr Tyr Val Val Cys Cys Glu Asn 
20 25 30 

Glu Asp Ser Leu Asn Gin Phe Leu Gin Gin Cys Trp Gin He Asp Glu 
35 40 45 

Gly Glu Lys Val Thr Asn Leu Glu Pro Leu Gly Phe Phe Thr Lys Val 
50 55 60 

Val Ser Arg Asp Glu Glu Asn Leu Arg Leu Asn Val Tyr Tyr Ala Lys 
65 ~ 70 75 80 

Ser Pro Leu Asp Ala Gin Thr Leu Gin Phe Leu Gly Val Phe Leu Arg 
85 90 95 



Gin Met Glu Thr Ser Gin He Arg Trp He Phe Leu Leu Asp Trp Leu 
100 105 110 



Leu Asp Asp Lys Arg Leu 
115 



Ala Leu 
130 

Val Val 
145 

Val Trp 
Ala Cys 
Ala Arg 



Glu Glu Ala Gin 



Leu Asn Pro Ser 
150 

Asn Ser Arg Arg 
165 

Leu Asn Thr Gly 
180 

Glu Asp Val Met 
195 



Trp Leu Arg Gin Leu Arg Asn Ser Trp Ala 
120 125 

Val Ala Pro Phe Pro Gly Gly Ala Val Val 

135 140 

His Val Thr Gin Leu Glu Arg Asn Thr Met 
155 160 

Leu Asp Leu Val His Gin Thr Leu Arg Ala 
170 175 

Ser Ala Leu Val Thr Leu Asp Pro Asn Thr 
185 190 

His lie Cys Ala Leu Leu Ala Gly Leu Pro 
200 205 



Thr Ser 
210 

His Gly 
225 

Pro Val 
Ala Ala 
His Pro 



Arg Pro Val Ala 



Ala Asp Ser lie 
230 

Ala Thr Val Phe 
245 

He Ala Pro Glu 
260 

Trp Leu Thr Glu 
275 



Met Leu Ser Leu Gin Ser Leu Phe He Pro 
215 220 

Gly Lys He Cys Thr He Ala Pro Glu Phe 
235 240 

Asp Asn Asp Phe Val Ser Ser Thr Phe Glu 
250 255 

Leu Thr Pro Gly Pro Arg Val Pro Ser Asp 
265 270 

Pro Thr Asn Pro Pro Ser Glu Ala Thr Ala 
280 285 



Trp His 
290 



Phe Asp Leu Gin 



Gly Arg Leu Ala Thr Leu Tyr Arg His Leu 
295 " 300 



Gly Asp Ser Asn Lys Ala 
305 ~ 310 



He Ser Val Thr Gin His Arg Phe His Lys 
315 320 



Pro Arg Ser Glu Asp Tyr 
325 

Pro Thr He Arg Asp Leu 
340 

Asp Val Ala Asp Ser He 
355 



Ala Tyr Glu Phe Glu Leu Pro Ser Lys His 
330 335 

He Arg Ser Ala Ala Ala Asp Ser Pro Asn 
345 350 

Asp Gly Leu Met Asp Gly He Val Gin Arg 
360 365 



Asn Val 
370 



His 



<210> 7 
<211> 3616 
<212> DNA 

<213> Ashbya gossypii 
<400> 7 

GGGCCCGGTG CCAGCTCGCC AGGTGCGGAC TCGCGCTCGG GCTGTGGGCG CTCTACCTGC 



TGCTGCTCGG CAGCTGCCTG ACGCGCGCGT ACGAGCTGTC GGATCTCGAA AACCTGGAAT 120 

CCGATTACTA CAGCTACGTG CTGGATGTGA ACTTCGCGCT GCTGAGCGCC ATGAGCGCGA 180 

CCGGCCTCGC GATGGGCGCC GTGAGCGGCT CCCTCGGGAG CGCGCCGGTG CTCGCGCAGT 240 

GGCCGGCAGC GATCTGGGCC GTGCGCTTCC TGCGCGCCGC GGGCTATGTC GCGATAGTCC 300 

TAATCCTGCC GTTCCTGTCC GTCGTCGCAT TCCTGCAGCC GCTCTGCGAG CGCGCGCTGG 360 

CGCTGTTCCC GTTTGTGCGC GCGTGGGGCA TGGACGGCGT GTTCAACTTC CTGCTGCTCT 420 

CCGCCGTGCT CTGGACTGTA TTCCTGGCCG TTCGCCTGCT CCGCGCCGTC TACAGACTGC 4 80 

TGCGCTGGCT GGTCGGTCTT TTGGTCCGCC TGGCACGCCT GCTGCTGCGA GGCGCCCGTC 54 0 

GGACGCCTGC GGCGGCCCCC GAGGAGCCCG TCTAGCGTGC GCGCGTTCTA GGCCCCTGAC 600 

AGCTCCTACC TGGTGCTGGC CGCCGGTAGG GCTCGCATCG TGCGGCGCAG GCCCATTGCT 660 

TTTTGGCCCC CGCTGGATCA TCGTTTCTTT TACGTGAAAA GTTTGCAGCG ATGAGCTGCA 720 

GTATAAATAG GTTTTCTAGA TGCGCCAAAT CCCAGCTGGG TTTACCGGCG TCTGTTCGGG 780 

ATAGTTACTT GATGGATGGG TCAACTTGAG AGCTTGGGTT TAGTGTTGAC TCCTTCTCTT 84 0 

CATAGCACGC CGAACAAAGC GCA ATG ACT TAC AGA GAC GCA GCC ACG GCA 890 

Met Thr Tyr Arg Asp Ala Ala Thr Ala 
1 5 

CTG GAG CAC CTG GCG ACG TAC GCC GAG AAG GAC GGG CTG TCC GTG GAG 938 
Leu Glu His Leu Ala Thr Tyr Ala Glu Lys Asp Gly Leu Ser Val Glu 
10 15 ~ 20 25 

CAG TTG ATG GAC TCC AAG ACG CGG GGC GGG TTG ACG TAC AAC GAC TTC 98 6 

Gin Leu Met Asp Ser Lys Thr Arg Gly Gly Leu Thr Tyr Asn Asp Phe 
30 35 40 

CTG GTC TTG CCG GGC AAG ATC GAC TTC CCA TCG TCG GAG GTG GTG CTG 1034 
Leu Val Leu Pro Gly Lys lie Asp Phe Pro Ser Ser Glu Val Val Leu 
45 50 55 

TCG TCG CGC CTG ACC AAG AAG ATC ACC TTG AAC GCG CCG TTT GTG TCG 1082 
Ser Ser Arg Leu Thr Lys Lys lie Thr Leu Asn Ala Pro Phe Val Ser 
60 65 70 

TCG CCG ATG GAC ACG GTG ACG GAG GCC GAC ATG GCG ATC CAC ATG GCG 1130 
Ser Pro Met Asp Thr Val Thr Glu Ala Asp Met Ala lie His Met Ala 
75 80 85 

CTC CTG GGC GGC ATC GGG ATC ATC CAC CAC AAC TGC ACT GCG GAG GAG 117 8 

Leu Leu Gly Gly lie Gly lie lie His His Asn Cys Thr Ala Glu Glu 
90 95 100 105 

CAG GCG GAG ATG GTG CGC CGG GTC AAG AAG TAC GAA AAC GGG TTC ATC 122 6 

Gin Ala Glu Met Val Arg Arg Val Lys Lys Tyr Glu Asn Gly Phe lie 
110 * 115 " ^ 120 



AAC GCC CCC GTG GTC GTG GGG CCG GAC GCG ACG GTG GCG GAC GTG CGC 
Asn Ala Pro Val Val Val Gly Pro Asp Ala Thr Val Ala Asp Val Arg 



1274 



125 



130 



135 



CGG ATG AAG AAC GAG TTT GGG TTT GCA GGA TTT CCT GTG ACA 1316 
Arg Met Lys Asn Glu Phe Gly Phe Ala Gly Phe Pro Val Thr 
140 145 150 

GGTATGTTAG AGTGGCACGC GGGGCTGCAC GCTGGGATGA TGATCATAAA TCAATAACTT 137 6 

TCGTTCTACT GACTGCGATC AAACGATCGT GTAGACACCT TTTACTCTGA CCGCAGACGT 14 36 

GCAGCGCCTT TTTGGCAGGA ACATGTACTA ACACATCAGC A GAT GAT GGC AAG 14 8 9 

Asp Asp Gly Lys 
1 

CCG ACC GGG AAG CTG CAG GGG ATC ATC ACG TCC CGT GAC ATC CAG TTT 1537 
Pro Thr Gly Lys Leu Gin Gly lie lie Thr Ser Arg Asp lie Gin Phe 
5 10 15 20 

GTC GAG GAC GAG ACC CTG CTT GTG TCT GAG ATC ATG ACC AAG GAC GTC 1585 
Val Glu Asp Glu Thr Leu Leu Val Ser Glu lie Met Thr Lys Asp Val 
25 30 35 

ATC ACT GGG AAG CAG GGC ATC AAC CTC GAG GAG GCG AAC CAG ATC CTG 1633 
lie Thr Gly Lys Gin Gly lie Asn Leu Glu Glu Ala Asn Gin lie Leu 
40 45 50 

AAG AAC ACC AAG AAG GGC AAG CTG CCA ATT GTG GAC GAG GCG GGC TGC 1681 
Lys Asn Thr Lys Lys Gly Lys Leu Pro lie Val Asp Glu Ala Gly Cys 
55 ' " 60 65 

CTG GTG TCC ATG CTT TCG AGA ACT GAC TTG ATG AAG AAC CAG TCC TAC 172 9 

Leu Val Ser Met Leu Ser Arg Thr Asp Leu Met Lys Asn Gin Ser Tyr 
70 75 80 

CCA TTG GCC TCC AAG TCT GCC GAC ACC AAG CAG CTG CTC TGT GGT GCT 1777 
Pro Leu Ala Ser Lys Ser Ala Asp Thr Lys Gin Leu Leu Cys Gly Ala 
85 " 90 ^ 95 100 

GCG ATC GGC ACC ATC GAC GCG GAC AGG CAG AGA CTG GCG ATG CTG GTC 1825 
Ala lie Gly Thr lie Asp Ala Asp Arg Gin Arg Leu Ala Met Leu Val 
105 110 115 

GAG GCC GGT CTG GAC GTT GTT GTG CTA GAC TCC TCG CAG GGT AAC TCG 1873 
Glu Ala Gly Leu Asp Val Val Val Leu Asp Ser Ser Gin Gly Asn Ser 
120 125 130 

GTC TTC CAG ATC AAC ATG ATC AAG TGG ATC AAG GAG ACC TTC CCA GAC 1921 
Val Phe Gin lie Asn Met lie Lys Trp lie Lys Glu Thr Phe Pro Asp 
135 140 145 

CTG CAG GTC ATT GCT GGC AAC GTG GTC ACC AGA GAG CAG GCT GCC AGC 1969 
Leu Gin Val He Ala Gly Asn Val Val Thr Arg Glu Gin Ala Ala Ser 
150 155 160 

TTG ATC CAC GCC GGC GCA GAC GGG TTG CGT ATC GGT ATG GGC TCT GGC 2017 
Leu He His Ala Gly Ala Asp Gly Leu Arg He Gly Met Gly Ser Gly 
165 170 175 180 



TCC ATC TGT ATC ACT CAG GAG GTG ATG GCC TGT GGT AGA CCA CAG GGT 
Ser He Cys He Thr Gin Glu Val Met Ala Cys Gly Arg Pro Gin Gly 



2065 



185 



190 



195 



ACC GCT GTC TAC AAC GTC ACG CAG TTC GCC AAC CAG TTT GGT GTG CCA 2113 
Thr Ala Val Tyr Asn Val Thr Gin Phe Ala Asn Gin Phe Gly Val Pro 
200 205 210 

TGT ATT GCT GAC GGT GGT GTC CAG AAC ATC GGG CAC ATT ACC AAA GCT 2161 
Cys lie Ala Asp Gly Gly Val Gin Asn lie Gly His lie Thr Lys Ala 
215 220 225 

ATC GCT CTT GGC GCG TCC ACC GTC ATG ATG GGC GGT ATG CTG GCA GGC 2209 
He Ala Leu Gly Ala Ser Thr Val Met Met Gly Gly Met Leu Ala Gly 
230 235 240 

ACT ACA GAG TCT CCA GGC GAG TAC TTC TTC AGG GAC GGG AAG AGA CTG 2257 
Thr Thr Glu Ser Pro Gly Glu Tyr Phe Phe Arg Asp Gly Lys Arg Leu 
245 250 255 260 

AAG ACC TAC AGA GGT ATG GGC TCC ATC GAC GCC ATG CAA AAG ACT GAT 2305 
Lys Thr Tyr Arg Gly Met Gly Ser He Asp Ala Met Gin Lys Thr Asp 
265 270 275 

GTC AAG GGT AAC GCC GCT ACC TCC CGT TAC TTC TCT GAG TCT GAC AAG 2353 
Val Lys Gly Asn Ala Ala Thr Ser Arg Tyr Phe Ser Glu Ser Asp Lys 
280 285 290 

GTT CTG GTC GCT CAG GGT GTT ACT GGT TCT GTG ATC GAC AAG GGC TCC 2401 
Val Leu Val Ala Gin Gly Val Thr Gly Ser Val He Asp Lys Gly Ser 
295 300 305 

ATC AAG AAG TAC ATT CCA TAT CTG TAC AAT 'GGT CTA CAG CAC TCG TGC 24 4 9 

He Lys Lys Tyr He Pro Tyr Leu Tyr Asn Gly Leu Gin His Ser Cys 
310 " " 315 320 

CAG GAT ATC GGT GTG CGC TCT CTA GTG GAG TTC AGA GAG AAG GTG GAC 24 97 

Gin Asp He Gly Val Arg Ser Leu Val Glu Phe Arg Glu Lys Val Asp 
325 330 335 340 

TCT GGC TCG GTC AGA TTT GAG TTC AGA ACT CCA TCT GCC CAG TTG GAG 2545 
Ser Gly Ser Val Arg Phe Glu Phe Arg Thr Pro Ser Ala Gin Leu Glu 
345 350 355 

GGT GGT GTG CAC AAC TTG CAC TCC TAC GAG AAG CGC CTA TTT GACTGAGTGC 2597 
Gly Gly Val His Asn Leu His Ser Tyr Glu Lys Arg Leu Phe Asp 
360 365 370 

CACTAGGCCC ACACTATAGA AGTGGATCCG GGCGCGATGG CACCCATACT TTTATATTAT 2 657 

GTTGATTGAT GTACGTAAAC GATAGATATA ATAACAGACG CGGCATCTCA TTTGTATGCA 2717 

ATATATCTGG AACATGGTTA TGCGTACTCA ACTGTATGTA CTACTTTATA TACACAGCTC 2777 

TGGGACACTT GGTGAGATAT ATGTTTCATT ATGTATGCCT CGCTATCGAA AGGTCTGGCA 2837 

TTATGGGCTA CTGGGTCTAA GAGTCATGGC TTATGAGTAT TTATTTATTT ATTTCTCTTC 2897 

CTTTTCATTA AACTCCTCGA GCTTCTTTCT GTAATACTGC TCTCTAGACT TCTCCACATC 2 957 



TGCTAATGAT GGTGGAAGTC GTTCGTTTTC CAAATCCGCT CTACGAGCGC GCTCGAAGTT 



3017 



AGACAGCGCC TCGTTCAGAC CTTCAGACCC GCGTGACAGC GCTCCACGAG GCAGCACGCC 3077 

AGAATTCATT GTTTTTAGGT ACTGCACCTT ATCGCTCTCT TCTCTCAACA CGCTATACAT 3137 

TCGGGAAACC TTGGCAATCG CCAATATTTT ACTGCGTAGT GCACGCCGTT TTGCATCATC 3197 

GTCCAGAATA GACCGTTTTT TCTTCGATTT CTTGGAGCCA GGTATAACAG TTACAACCTG 3257 

CTCAGTGTTT TTGGACTTCA ATGTAGCACC TAAGTCCTCC CTTATAACAA AAGTCTCTTC 3317 

CTCCAATTCT TCTTCAGTAC AAATGTTTAA TATCGAAACC AACATTTCAG TCACTTTCTC 3377 

GCCAACAAAT GGCAAAGACC AGGTGAATAC GTCCATGAAA TTCGGTAACC AATACGGATG 3437 

CTGTGACATG TTAAATTGTC TAATGTTCAT AACGTTATCC GAGTATTTTA GGACCGCGGC 34 97 

CTTGTTCTTG TAAGTGTCCA AGTAGTTGGG TGCGCTGAAC AACGTAAGTA AACTAGGAAA 3557 

GCCCAGATTC TTGGTATTCT TGTACATTCT GTAGCCCTGA TCTTGGGCTT CGTGGGCCC 3616 



<210> 8 
<211> 151 
<212> PRT 

<213> Ashbya gossypii 
<400> 8 

Met Thr Tyr Arg Asp Ala 
1 5 

Ala Glu Lys Asp Gly Leu 
20 

Arg Gly Gly Leu Thr Tyr 
35 

Asp Phe Pro Ser Ser Glu 
50 

lie Thr Leu Asn Ala Pro 
65 70 

Glu Ala Asp Met Ala lie 
85 

lie His His Asn Cys Thr 
100 

Val Lys Lys Tyr Glu Asn 
115 

Pro Asp Ala Thr Val Ala 
130 

Phe Ala Gly Phe Pro Val 
145 150 



Ala Thr Ala Leu Glu His Leu Ala Thr Tyr 
10 15 

Ser Val Glu Gin Leu Met Asp Ser Lys Thr 
25 30 

Asn Asp Phe Leu Val Leu Pro Gly Lys lie 
40 45 

Val Val Leu Ser Ser Arg Leu Thr Lys Lys 
55 60 

Phe Val Ser Ser Pro Met Asp Thr Val Thr 
75 80 

His Met Ala Leu Leu Gly Gly He Gly He 
90 95 

Ala Glu Glu Gin Ala Glu Met Val Arg Arg 
105 110 

Gly Phe He Asn Ala Pro Val Val Val Gly 
120 125 

Asp Val Arg Arg Met Lys Asn Glu Phe Gly 
135 140 



Thr 



<210> 9 



<211> 371 
<212> PRT 

<213> Ashbya gossypii 

\ 

<400> 9 

Asp Asp Gly Lys Pro Thr Gly Lys Leu Gin Gly lie lie Thr Ser Arg 
15 10 15 

Asp lie Gin Phe Val Glu Asp Glu Thr Leu Leu Val Ser Glu lie Met 
20 25 t 30 

Thr Lys Asp Val lie Thr Gly Lys Gin Gly lie Asn Leu Glu Glu Ala 
35 40 45 

Asn Gin lie Leu Lys Asn Thr Lys Lys Gly Lys Leu Pro lie Val Asp 
50 55 60 

Glu Ala Gly Cys Leu Val Ser Met Leu Ser Arg Thr Asp Leu Met Lys 
65 70 75 80 

Asn Gin Ser Tyr Pro Leu Ala Ser Lys Ser Ala Asp Thr Lys Gin Leu 
85 90 ~ 95 

Leu Cys Gly Ala Ala lie Gly Thr lie Asp Ala Asp Arg Gin Arg Leu 
100 ' 105 ' * 110 

Ala Met Leu Val Glu Ala Gly Leu Asp Val Val Val Leu Asp Ser Ser 
115 120 125 

Gin Gly Asn Ser Val Phe Gin lie Asn Met lie Lys Trp lie Lys Glu 
130 135 140 

Thr Phe Pro Asp Leu Gin Val lie Ala Gly Asn Val Val Thr Arg Glu 
145 ' 150 155 160 

Gin Ala Ala Ser Leu He His Ala Gly Ala Asp Gly Leu Arg lie Gly 
165 170 175 

Met Gly Ser Gly Ser He Cys He Thr Gin Glu Val Met Ala Cys Gly 
180 185 190 

Arg Pro Gin Gly Thr Ala Val Tyr Asn Val Thr Gin Phe Ala Asn Gin 
195 200 205 

Phe Gly Val Pro Cys He Ala Asp Gly Gly Val Gin Asn He Gly His 
210 215 220 

He Thr Lys Ala He Ala Leu Gly Ala Ser Thr Val Met Met Gly Gly 
225 230 235 240 

Met Leu Ala Gly Thr Thr Glu Ser Pro Gly Glu Tyr Phe Phe Arg Asp 
245 250 255 

Gly Lys Arg Leu Lys Thr Tyr Arg Gly Met Gly Ser He Asp Ala Met 
260 265 270 

Gin Lys Thr Asp Val Lys Gly Asn Ala Ala Thr Ser Arg Tyr Phe Ser 
275 280 285 



Glu Ser Asp Lys Val Leu Val Ala Gin Gly Val Thr Gly Ser Val He 
290 295 300 

Asp Lys Gly Ser He Lys Lys Tyr He Pro Tyr Leu Tyr Asn Gly Leu 
305 310 315 320 

Gin His Ser Cys Gin Asp He Gly Val Arg Ser Leu Val Glu Phe Arg 
325 330 335 

Glu Lys Val Asp Ser Gly Ser Val Arg Phe Glu Phe Arg Thr Pro Ser 
340 345 350 

Ala Gin Leu Glu Gly Gly Val His Asn Leu His Ser Tyr Glu Lys Arg 
355 360 365 

Leu Phe Asp 
370 

<210> 10 
<211> 2697 
<212> DNA 

<213> Ashbya gossypii 
<400> 10 

AT C GAT T TCA GGAGATTTTT GGTAGCATTA TTGAGGTCAT TAGAGGCGTT CTGTGACTTT 60 

CGACGATTTG CACGCGCAGA AGAGGGCGTT CAACCAGCCT TTCGGATATT CCGGTTCGAG 120 

TTATACCAGC AGGGATCAGC GCAGGCACTA GAGTGGCGGG TGCTAATAAG AGGAGCAGGT 180 

CCTGGAACTG AAGTTGCAAG AGATAAGCAT TGCGCGGAGA AGGAGGCGGT TAGAGGGTGC 24 0 

AAG C GAG C AG GATGGGGTCT TCGATGAACT TCCCGTCTGG GTATGTGAAC AAGCACACGC 300 

TGCAGGCACA CCGGTAGGGC GAGTGCAGGG TGAAAAATAT ATATGCGCTC GAGAAGCGCT 360 

GGGGATGAGT TCGTCTGCAA CGGCAGGCGG ATCTTCATCT GACAAAACCA GCTGCCTACA 420 

TCAGTGCGAA GCTGTTCAGT GATAGAATAG GAGTA ATG GCT GCT GTT GAA CAA 473 

Met Ala Ala Val Glu Gin 
1 5 

GTT TCT AGC GTG TTT GAC ACC ATT TTG GTG CTG GAC TTC GGG TCC CAG 521 
Val Ser Ser Val Phe Asp Thr He Leu Val Leu Asp Phe Gly Ser Gin 
10 15 20 

TAC TCG CAT CTG ATC ACG CGG CGG CTG CGT GAG TTT AAT GTG TAC GCG 569 
Tyr Ser His Leu He Thr Arg Arg Leu Arg Glu Phe Asn Val Tyr Ala 
25 30 35 

GAG ATG CTT CCG TGT ACG CAG AAG ATC AGC GAG CTG GGC TGG AAG CCA 617 
Glu Met Leu Pro Cys Thr Gin Lys He Ser Glu Leu Gly Trp Lys Pro' 
40 45 50 

AAG GGT GTG ATT TTG TCA GGC GGG CCG TAC TCC GTG TAC GCG GCA GAT 665 
Lys Gly Val lie Leu Ser Gly Gly Pro Tyr Ser Val Tyr Ala Ala Asp 
55 60 65 70 



GCT CCG CAC GTG GAC CGG GCG GTG TTC GAG TTG GGC GTT CCA ATT CTG 



713 



Ala Pro His Val 



Asp Arg Ala Val Phe Glu Leu Gly Val Pro lie Leu 
75 ' 80 85 



GGC ATC 
Gly He 



TGC TAC 
Cys Tyr 
90 



GGG CTA CAG 
Gly Leu Gin 



GAG CTT GCG TGG ATA GCC 
Glu Leu Ala Trp He Ala 
95 



GGC GCA GAG 
Gly Ala Glu 
100 



761 



GTG GGG CGC GGC GAG AAG CGC GAG TAC GGG CGC GCG ACG CTG CAC GTG 
Val Gly Arg Gly Glu Lys Arg Glu Tyr Gly Arg Ala Thr Leu His Val 
105 "* 110 " 115 



809 



GAG GAC 
Glu Asp 
120 



AGC GCG 
Ser Ala 



TGC CCG CTG 
Cys Pro Leu 
125 



TTC AAC AAC GTG 
Phe Asn Asn Val 



GAC AGC 
Asp Ser 
130 



AGC ACG GTG 
Ser Thr Val 



857 



TGG ATG TCG CAC GGT GAC AAG CTG CAC GCA CTA CCT GCG GAT TTC CAC 
Trp Met Ser His Gly Asp Lys Leu His Ala Leu Pro Ala Asp Phe His 
135 1 140 145 " 150 



905 



GTC ACT GCG ACG ACG GAG AAC TCT CCT TTC TGC GGG ATT GCA CAC GAC 
Val Thr Ala Thr Thr Glu Asn Ser Pro Phe Cys Gly He Ala His Asp 
155 160 165 



953 



TCG AAG 
Ser Lys 



CCA ATC 
Pro He 
170 



TTC GGG ATC 
Phe Gly He 



CAG TTC CAC CCT GAG GTG 
Gin Phe His Pro Glu Val 
175 



ACG CAC TCC 
Thr His Ser 
180 



1001 



TCG CAG GGG AAG ACG TTG CTG AAG AAC TTT GCG GTG GAG ATC TGC CAG 
Ser Gin Gly Lys Thr Leu Leu Lys Asn Phe Ala Val Glu He Cys Gin 
185 190 195 



1049 



GCC GCG 
Ala Ala 
200 



CAG ACC 
Gin Thr 



TGG ACG ATG 
Trp Thr Met 
205 



GAA AAC TTC ATT 
Glu Asn Phe He 



GAC ACC 
Asp Thr 
210 



GAG ATC CAG 
Glu He Gin 



1097 



CGG ATC CGG ACC CTT GTG GGC CCC ACC GCG GAA GTC ATC GGT GCT GTG 
Arg He Arg Thr Leu Val Gly Pro Thr Ala Glu Val He Gly Ala Val 
215 220 225 230 



1145 



TCC GGC 
Ser Gly 



GGT GTC 
Gly Val 



GAC TCG ACC 
Asp Ser Thr 
235 



GTC GCT GCG AAG CTG ATG 
Val Ala Ala Lys Leu Met 
240 



ACC GAG GCC 
Thr Glu Ala 
245 



1193 



ATC GGC GAC CGG TTC CAC GCG ATC CTG GTC GAC AAC GGT GTT CTG CGC 
He Gly Asp Arg Phe His Ala lie Leu Val Asp Asn Gly Val Leu Arg 
250 255 " 260 



1241 



CTC AAC GAA GCG GCC AAT GTG AAG AAA ATC CTC GGC GAG GGC TTG GGC 
Leu Asn Glu Ala Ala Asn Val Lys Lys He Leu Gly Glu Gly Leu Gly 
265 270 275 



1289 



ATC AAC 
He Asn 
280 



TTG ACT 
Leu Thr 



GTT GTT GAC 
Val Val Asp 
285 



GCC TCC GAA GAG 
Ala Ser Glu Glu 



TTC TTG 
Phe Leu 
290 



ACG AAG CTC 
Thr Lys Leu 



1337 



AAG GGC 
Lys Gly 
295 



GTC ACG 
Val Thr 



GAC CCT GAG 
Asp Pro Glu 
300 



AAG AAG AGA AAG ATC ATC 
Lys Lys Arg Lys He He 
305 



GGT AAC ACC 
Gly Asn Thr 
310 



1385 



TTC ATT CAT GTT TTT GAG CGC GAG GCA GCC AGG ATC CAG CCT AAG AAC 



1433 



Phe He His Val Phe Glu Arg Glu Ala Ala Arg He Gin Pro Lys Asn 
315 320 325 



GGC GAG GAG ATT GAG TTC CTG TTG CAG GGT ACC CTA TAC CCT GAC GTT 14 81 

Gly Glu Glu He Glu Phe Leu Leu Gin Gly Thr Leu Tyr Pro Asp Val 
330 335 340 

ATC GAG TCC ATT TCC TTT AAG GGC CCA TCT CAG ACG ATC AAG ACC CAC 1529 
He Glu Ser He Ser Phe Lys Gly Pro Ser Gin Thr He Lys Thr His 
345 350 355 

CAT AAC GTC GGT GGT CTT TTG GAC AAC ATG AAA CTG AAG CTC ATT GAG 1577 
His Asn Val Gly Gly Leu Leu Asp Asn Met Lys Leu Lys Leu He Glu 
360 ~ " 365 370 

CCT TTG CGC GAG CTT TTC AAG GAC GAG GTG AGA CAC CTG GGA GAA CTA 1625 
Pro Leu Arg Glu Leu Phe Lys Asp Glu Val Arg His Leu Gly Glu Leu 
375 " 380 " 385 390 

TTG GGG ATC TCC CAC GAG TTG GTC TGG AGA CAT CCG TTC CCA GGC CCA 1673 
Leu Gly He Ser His Glu Leu Val Trp Arg His Pro Phe Pro Gly Pro 
395 400 405 

GGT ATC GCC ATC CGT GTG CTA GGC GAG GTC ACC AAG GAG CAG GTG GAG 1721 
Gly He Ala He Arg Val Leu Gly Glu Val Thr Lys Glu Gin Val Glu 
410 415 420 

ATT GCC AGA AAG GCA GAC CAC ATC TAC ATC GAG GAG ATC AGG AAA GCA 17 69 

He Ala Arg Lys Ala Asp His He Tyr He Glu Glu He Arg Lys Ala 
425 . 430 435 

GGT CTA TAC AAC AAG ATT TCT CAA GCT TTT GCT TGC TTG CTG CCT GTT 1817 
Gly Leu Tyr Asn Lys lie Ser Gin Ala Phe Ala Cys Leu Leu Pro Val 
440 "* 445 450 

AAG TCT GTG GGT GTC ATG GGT GAC CAG AGA ACC TAC GAC CAG GTC ATT 1865 
Lys Ser Val Gly Val Met Gly Asp Gin Arg Thr Tyr Asp Gin Val He 
455 460 ' " 465 470 

GCT CTA AGA GCA ATT GAG ACC ACG GAC TTC ATG ACT GCC GAC TGG TAT 1913 
Ala Leu Arg Ala He Glu Thr Thr Asp Phe Met Thr Ala Asp Trp Tyr 
475 480 485 

CCA TTT GAG CAC GAA TTC TTG AAG CAT GTC GCA TCC CGT ATT GTT AAC 1961 
Pro Phe Glu His Glu Phe Leu Lys His Val Ala Ser Arg lie Val Asn 
490 495 500 

GAG GTT GAA GGT GTT GCC AGA GTC ACC. TAC GAC ATA ACT TCT AAG CCT 2009 
Glu Val Glu Gly Val Ala Arg Val Thr Tyr Asp He Thr Ser Lys Pro 
505 510 515 

CCA GCT ACC GTT GAA TGG GAA TAATCACCCT TGGGATCCGC TGACTGGCTA 2060 
Pro Ala Thr Val Glu Trp Glu 
520 525 

CTGTAATTCT ATGTAGTGGA T TAG TAC GAT AAGTTACTTT TGTATGATAG ATGTAATCAC 2120 

ATCTGGCTAT TAAAATGACT CAGCCGAGGT AAATCTAACG TCCCTTCACA AGGGTGTTCC 2180 

TGTGTGGACT TCCGCCTGAA T TTT TAT AGA TATATAGATA CTCTACTCAT G AAC AAC CTG 2240' 



PA APP^IA ATA 


rtu^n 1 1 rtu l o 


PPAP^APAAP 


APAAPpr;Tf^fi 


AAATftGGGPA 

ruin X \J \J VJ O Oxi. 


AGTA^AAAAA 


2300 


ATPATATTPP 


TTAAf^AATAA 


ftAPAftTAPPA 




ACGAGACGAT 


TTTTGAATCG 


2360 


AATGftPTTPP 


AGAPTPAPTT 


TGTAPPCATA 


ATAAPCCATG 


AACACCT GCC 


AGATGAGGTT 


2420 


PPPPPTPPAP 


TATPP^APHP 


PAATTAPATG 


AAAAftftTTGA 


ATTGTTTGGT 


GGTAAPPTCT 


2480 


CAGAGGACTG 


TGGAGTGTCT 


C TAT GAG G AC 


GTTCTGCCCT 


CTCTTCCAGC 


TGAAGCACGC 


2540 


AAATCTCTTC 


TCAATACGCC 


AGTATTCGTG 


GTTGGGCGTG 


CCACTCAGGA 


AT T TAT G GAG 


2600 


AGATGCGGCT 


TTACGGACGT 


GAGAGGGGGA 


TCTGAGACTG 


GTAATGGCGT 


TTTGCTAGCG 


2660 


GAGTTAATGT 


TAAATATGAT 


CCAGAAGGGC 


GATGGGG 






2697 



<210> 11 
<211> 525 
<212> PRT 

<213> Ashbya gossypii 
<400> 11 

Met Ala Ala Val Glu Gin Val Ser Ser Val Phe Asp Thr He Leu Val 
15 10 15 

Leu Asp Phe Gly Ser Gin Tyr Ser His Leu He Thr Arg Arg Leu Arg 
20 25 30 

Glu Phe Asn Val Tyr Ala Glu Met Leu Pro Cys Thr Gin Lys He Ser 
35 40 45 

Glu Leu Gly Trp Lys Pro Lys Gly Val He Leu Ser Gly Gly Pro Tyr 
50 ~ * 55 " 60 

Ser Val Tyr Ala Ala Asp Ala Pro His Val Asp Arg Ala Val Phe Glu 
65 70 75 80 

Leu Gly Val Pro He Leu Gly He Cys Tyr Gly Leu Gin Glu Leu Ala 
85 90 95 

Trp He Ala Gly Ala Glu Val Gly Arg Gly Glu Lys Arg Glu Tyr Gly 
100 105 110 

Arg Ala Thr Leu His Val Glu Asp Ser Ala Cys Pro Leu Phe Asn Asn 
115 120 125 

Val Asp Ser Ser Thr Val Trp Met Ser His Gly Asp Lys Leu His Ala 
130 135 140 

Leu Pro Ala Asp Phe His Val Thr Ala Thr Thr Glu Asn Ser Pro Phe 
145 150 155 160 

Cys Gly He Ala His Asp Ser Lys Pro He Phe Gly He Gin Phe His 
165 170 175 



Pro Glu Val Thr His Ser Ser Gin Gly Lys Thr Leu Leu Lys Asn Phe 
180 185 190 



Ala Val Glu He Cys Gin Ala Ala Gin Thr Trp Thr 
195 200 



Met Glu Asn Phe 
205 



He Asp Thr Glu He Gin Arg He Arg Thr Leu Val Gly Pro Thr Ala 
210 215 220 



Glu Val He Gly Ala Val Ser Gly Gly Val Asp Ser 
225 230 235 



Thr Val Ala Ala 
240 



Lys Leu Met Thr Glu Ala He Gly Asp Arg Phe His 
245 250 

Asp Asn Gly Val Leu Arg Leu Asn Glu Ala Ala Asn 
260 265 

Leu Gly Glu Gly Leu Gly lie Asn Leu Thr Val Val 
275 280 



Ala He Leu Val 
255 

Val Lys Lys He 
270 

Asp Ala Ser Glu 
285 



Glu Phe Leu Thr Lys Leu Lys Gly Val Thr Asp Pro 
290 295 300 

Lys He He Gly Asn Thr Phe He His Val Phe Glu 
305 310 315 



Glu Lys Lys Arg 



Arg Glu Ala Ala 
320 



Arg He Gin Pro Lys Asn Gly Glu Glu He Glu Phe 
325 330 



Leu Leu Gin Gly 
335 



Thr Leu Tyr Pro Asp Val He Glu Ser He Ser Phe 
340 345 



Lys Gly Pro Ser 
350 



Gin Thr He Lys Thr His His Asn Val Gly Gly Leu 
355 360 



Leu Asp Asn Met 
365 



Lys Leu Lys Leu He Glu Pro Leu Arg Glu Leu Phe 
370 375 380 

Arg His Leu Gly Glu Leu Leu Gly He Ser His Glu 
385 390 395 



Lys Asp Glu Val 



Leu Val Trp Arg 
400 



His Pro Phe Pro Gly Pro Gly He Ala He Arg Val 
405 410 



Leu Gly Glu Val 
415 



Thr Lys Glu Gin Val Glu He Ala Arg Lys Ala Asp 
420 425 



His He Tyr He 
430 



Glu Glu He Arg Lys Ala Gly Leu Tyr Asn Lys He 
435 440 



Ser Gin Ala Phe 
445 



Ala Cys Leu Leu Pro Val Lys Ser Val Gly Val Met Gly Asp Gin Arg 
450 455 460 



Thr Tyr Asp Gin Val He Ala Leu Arg Ala lie Glu 
4 65 470 ~ 475 



Thr Thr Asp Phe 
480 



Met Thr Ala Asp Trp Tyr Pro Phe Glu His Glu Phe 
485 490 



Leu Lys His Val 
495 



Ala Ser Arg He Val Asn Glu Val Glu Gly Val Ala Arg Val Thr Tyr 
500 505 510 



Asp lie Thr Ser Lys Pro Pro Ala Thr Val Glu Trp Glu 
515 520 525 



<210> 12 
<211> 1634 
<212> DNA 

<213> Ashbya gossypii 
<400> 12 

CCTCGAACAT CTATCTTCTG AGC TC GAT AG TCTACGAAAT CGGCACACTA GCCTAATTGC 60 

C GAG ATG AAG AGCTCCAGGG AACCGTTAAA GATCTGATGT TCCATCTTCA ATCAGGACAA 120 

ATGTTACGGG ATGTCCCTGA CGCCACAGAA GGTAGCCTGG TGGTCCAGAC AGAAAAAGAG 180 

CCTACACCAA AGAAGAAACA TAACAAGAAA AAGCCTCCGC ATCGTTTTGG TAAATCATAA 24 0 

TAGGCACGAT GCGCATATAC CCTGACCATC ATAGCGGTTC CCCCCGCTAA CTGCTCCGAG 300 

CGGGTAACCC CATGTCACAA AGTGACTCTG TCTCTTCGTG GTAGGTGATG TCAAATTTTC 360 

ACGACTTCCC ACCCCGATGA GCATCCGTAT TCCTTTTCAT CTAAATTCTA ATAGATGGCT 4 20 

TATGGATTCT TATTGGCGAC TTACAAGCCT ATGTAGTTGG CTTCCCTCAA GTGTTCGTAG 4 80 

TCTACCACCT CACACCCGGT CTAACAGCTT ACGAGAATA ATG GCT ACT AAT GCA 534 



Met Ala Thr Asn Ala 
1 5 



ATC AAG CTT CTT GCG CCA GAT ATC CAC AGG GGT CTG GCA GAG CTG GTC 
lie Lys Leu Leu Ala Pro Asp lie His Arg Gly Leu Ala Glu Leu Val 
10 ^ 15 20 



582 



GCT AAA CGC CTA GGC TTA CGT CTG ACA GAC TGC AAG CTT AAG CGG GAT 
Ala Lys Arg Leu Gly Leu Arg Leu Thr Asp Cys Lys Leu Lys Arg Asp 
25 ~ 30 35 



630 



TGT AAC GGG GAG GCG ACA TTT TCG ATC GGA GAA TCT GTT CGA GAC CAG 
Cys Asn Gly Glu Ala Thr Phe Ser lie Gly Glu Ser Val Arg Asp Gin 
40 45 50 



678 



GAT ATC TAC ATC ATC ACG CAG GTG GGG TCC GGG GAC GTG AAC GAC CGA 
Asp He Tyr He He Thr Gin Val Gly Ser Gly Asp Val Asn Asp Arg 
55 60 65 



726 



GTG CTG GAG CTG CTC ATC ATG ATC AAC GCT AGC AAG ACG GCG TCT GCG 
Val Leu Glu Leu Leu He Met He Asn Ala Ser Lys Thr Ala Ser Ala 
70 75 80 85 



774 



CGG CGA ATT ACG GCT GTG ATT CCA AAC TTC CCA TAC GCG CGG CAG GAC 
Arg Arg lie Thr Ala Val He Pro Asn Phe Pro Tyr Ala Arg Gin Asp 
90 95 100 



822 



CGG AAG GAT AAG TCA CGG GCG CCA ATT ACC GCG AAG CTC ATG GCG GAC 
Arg Lys Asp Lys Ser Arg Ala Pro He Thr Ala Lys Leu Met Ala Asp 
105 110 115 



870 



ATG CTG ACT ACC GCG GGC TGC GAT CAT GTC ATC ACC ATG GAC TTA CAC 
Met Leu Thr Thr Ala Gly Cys Asp His Val He Thr Met Asp Leu His 



918 



120 125 130 

GCT TCG CAA ATC CAG GGC TTC TTT GAT GTA CCA GTT GAC AAC CTT TAC 966 
Ala Ser Gin lie Gin Gly Phe Phe Asp Val Pro Val Asp Asn Leu Tyr 
135 140 145 

GCA GAG CCT AGC GTG GTG AAG TAT ATC AAG GAG CAT ATT CCC CAC GAC 1014 
Ala Glu Pro Ser Val Val Lys Tyr lie Lys Glu His lie Pro His Asp 
150 155 160 165 

GAT GCC ATC ATC ATC TCG CCG GAT GCT GGT GGT GCC AAA CGT GCG TCG 1062 
Asp Ala lie lie lie Ser Pro Asp Ala Gly Gly Ala Lys Arg Ala Ser 
170 175 180 

CTT CTA TCA GAT CGC CTA AAC TTG AAC TTT GCG CTG ATT CAT AAG GAA 1110 
Leu Leu Ser Asp Arg Leu Asn Leu Asn Phe Ala Leu lie His Lys Glu 
185 " 190 195 

CGT GCA AAG GCA AAC GAA GTG TCC CGC ATG GTT CTG GTC GGC GAT GTT 1158 
Arg Ala Lys Ala Asn Glu Val Ser Arg Met Val Leu Val Gly Asp Val 
200 205 210 

ACC GAT AAA GTC TGC ATT ATC GTT GAC GAT ATG GCG GAT ACT TGT GGT 1206 
Thr Asp Lys Val Cys lie lie Val Asp Asp Met Ala Asp Thr Cys Gly 
215 220 225 

ACG CTG GCC AAG GCG GCA GAA GTG CTG CTA GAG CAC AAC GCG CGG TCT 1254 
Thr Leu Ala Lys Ala Ala Glu Val Leu Leu Glu His Asn Ala Arg Ser 
230 235 240 245 

GTG ATA GCC ATT GTT ACC CAC GGT ATC CTT TCA GGA AAG GCC ATT GAG 1302 
Val He Ala He Val Thr His Gly He Leu Ser Gly Lys Ala He Glu 
250 255 260 

AAC ATC AAC AAT TCG AAG CTT GAT AGG GTT GTG TGT ACC AAC ACC GTG 1350 
Asn He Asn Asn Ser Lys Leu Asp Arg Val Val Cys Thr Asn Thr Val 
265 ~ ~ 270 " 275 

CCA TTC GAG GAG AAG ATG AAG TTA TGC CCG AAG TTA GAT GTA ATT GAT 1398 
Pro Phe Glu Glu Lys Met Lys Leu Cys Pro Lys Leu Asp Val lie Asp 
280 285 " 290 

ATC TCG GCA GTT CTT GCG GAA TCC ATT CGC CGT CTA CAC AAT GGT GAA 14 4 6 

He Ser Ala Val Leu Ala Glu Ser He Arg Arg Leu His Asn Gly Glu 
295 300 305 

AGT ATC TCC TAC CTC TTT AAA AAC AAC CCA CTA TGATTTTGCT TCTCGATGCT 14 99 
Ser He Ser Tyr Leu Phe Lys Asn Asn Pro Leu 
310 315 320 

GGCTTCTTGA GGGCCAATTT TGCCGTAGAG GTAGTATCCC TTCTTTTTAT ATTGACTATT 1559 

TAACGAAGAC TATTTCTTCA TAAATGGACT TCGGCTTCAC TGTGAATCTC ACATGATATA 1619 

GTTGTTTCAG AG ACC 1634 



<210> 13 
<211> 320 
<212> PRT 



<213> Ashbya gossypii 
<400> 13 



Met Ala 
1 



Thr Asn Ala lie 
5 



Lys Leu Leu Ala Pro Asp lie His Arg Gly 
10 15 



Leu Ala Glu Leu Val Ala 
20 



Lys Arg Leu Gly Leu Arg Leu Thr Asp Cys 
25 " " 30 



Lys Leu Lys Arg Asp Cys 
35 



Asn Gly Glu Ala Thr Phe Ser lie Gly Glu 
40 45 



Ser Val Arg Asp Gin Asp lie Tyr lie lie Thr Gin Val Gly Ser Gly 
50 55 60 



Asp Val 
65 



Asn Asp Arg Val 
70 



Leu Glu Leu Leu lie Met lie Asn Ala Ser 
75 80 



Lys Thr 
Tyr Ala 
Lys Leu 



Ala Ser Ala Arg 
85 

Arg Gin Asp Arg 
100 

Met Ala Asp Met 
115 



Arg lie Thr Ala Val lie Pro Asn Phe Pro 
90 95 

Lys Asp Lys Ser Arg Ala Pro lie Thr Ala 
105 ~ 110 

Leu Thr Thr Ala Gly Cys Asp His Val lie 
120 125 



Thr Met Asp Leu His Ala Ser Gin lie Gin Gly Phe Phe Asp Val Pro 
130 135 140 



Val Asp 
145 



Asn Leu Tyr Ala 
150 



Glu Pro Ser Val Val Lys Tyr lie Lys Glu 
155 160 



His lie Pro His Asp Asp 
165 



Ala lie lie lie Ser Pro Asp Ala Gly Gly 
170 175 



Ala Lys Arg Ala Ser Leu 
180 



Leu Ser Asp Arg Leu Asn Leu Asn Phe Ala 
185 " 190 



Leu lie His Lys Glu Arg 
195 



Ala Lys Ala Asn Glu Val Ser Arg Met Val 
200 205 



Leu Val Gly Asp Val Thr Asp Lys Val Cys He He Val Asp Asp Met 
210 215 220 



Ala Asp 
225 



Thr Cys Gly Thr 
230 



Leu Ala Lys Ala Ala Glu Val Leu Leu Glu 
235 240 



His Asn Ala Arg Ser Val 
245 



He Ala He Val Thr His Gly He Leu Ser 
250 255 



Gly Lys 
Cys Thr 



Ala He Glu Asn 
260 

Asn Thr Val Pro 
275 



He Asn Asn Ser Lys Leu Asp Arg Val Val 
265 * 270 

Phe Glu Glu Lys Met Lys Leu Cys Pro Lys 
280 ~ 285 



Leu Asp Val He Asp He Ser Ala Val Leu Ala Glu Ser He Arg Arg 
290 * 295 300 



Leu His Asn Gly Glu Ser lie Ser Tyr Leu Phe Lys Asn Asn Pro Leu 
305 1 310 315 320 



<210> 14 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 14 

gatgctagag accgcggggt gcaac 25 



<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 15 

tgtccgccat gtcgtctaca ataata 26 



<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 16 

atatcttgat gaagacgttc accgt 25 



<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 17 

gataatgacg gcttggccgg gaaga 25 



<210> 18 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 



<400> 18 



ggcatcaacc tcgaggaggc gaacc 



<210> 19 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 19 

cagaccggcc tcgaccagca tcgcc 



<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 20 

tggaccgggc ggtgttcgag ttggg 



<210> 21 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 21 

aggctggatc ctggctgcct cgcgc 



